On classification between normal and pathological voices using the MEEI-kayPENTAX database: issues and consequences
نویسندگان
چکیده
A large amount of research in pathological voice classification consider the task of feature extraction for discrimination between normal and dysphonic sustained vowels. The most widely used dataset for this purpose is the Massachusetts Eye & Ear Infirmary (MEEI) Voice Disorders Database commercialized by KayPENTAX Corp. During the last two decades, dozens of methods have been proposed to extract discriminative features from these signals in order to design accurate classifiers between the two classes of this database. The main contribution of this paper is to show that the normal and dysphonic sustained vowels of the KayPENTAX database are actually perfectly separable. This implies that this dataset is not suited for the normalvs-dysphonic classification task, as long as the only concern is to achieve high classification accuracy. Indeed, we show that a single scalar parameter extracted from a matching pursuit decomposition of these signals (with a Gabor dictionary) yields a prefect classification accuracy (100 % with a large margin). We then discuss the implication of this finding on the precaution that should be taken with this database and on research in pathological voice detection in general.
منابع مشابه
On the Use of the Correlation between Acoustic Descriptors for the Normal/Pathological Voices Discrimination
This paper presents an analysis system aiming at discriminating between normal and pathological voices. Compared to literature of voice pathology assessment, it is characterised by two aspects. First the system is based on features inspired from voice pathology assessment and music information retrieval. Second the distinction between normal and pathological voices is simply based on the correl...
متن کاملNovel VTEO Based Mel Cepstral Features for Classification of Normal and Pathological Voices
In this paper, novel Variable length Teager Energy Operator (VTEO) based Mel cepstral features, viz., VTMFCC are proposed for automatic classification of normal and pathological voices. Experiments have been carried out using this proposed feature set, MFCC and their score-level fusion. Classification was performed using a 2 order polynomial classifier on a subset of the MEEI database. The equa...
متن کاملAcoustic analysis of normal Saudi adult voices.
OBJECTIVE To determine the acoustic differences between Saudi adult male and female voices, and to compare the acoustic variables of the Multidimensional Voice Program (MDVP) obtained from North American adults to a group of Saudi males and females. METHODS A cross-sectional survey of normal adult male and female voices was conducted at King Abdulaziz University Hospital, Riyadh, Kingdom of S...
متن کاملNormal and Organic Pathology Classification of Female Voices Using SVM Classiifier
In this paper, we propose to achieve the classification of normal and pathologic female voices and essentially the classification between organic female voice pathologies: it’s about edema and nodule pathologies. Besides, we propose to study the effect of the fundamental frequency and the open quotient parameters composing the feature vector on the performance rates in addition to the MFCC and ...
متن کاملVoice pathology detection and classification using MPEG-7 audio low-level features
In this paper, a new pathological voice detection and pathology classification method based on MPEG-7 audio lowlevel features is proposed. MPEG-7 features are originally used for multimedia indexing, which includes both video and audio. Indexing is related to event detection, and as pathological voice is a separate event than normal voice, we show that MPEG-7 audio low-level features can do ver...
متن کامل